Picture for Xikang Yang

Xikang Yang

When the Manual Lies: A Realistic Benchmark to Evaluate MCP Poisoning Attacks for LLM Agents

Add code
May 22, 2026
Viaarxiv icon

Exploiting Synergistic Cognitive Biases to Bypass Safety in LLMs

Add code
Jul 30, 2025
Viaarxiv icon

The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models

Add code
Nov 18, 2024
Figure 1 for The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Figure 2 for The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Figure 3 for The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Figure 4 for The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Viaarxiv icon

Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens

Add code
Jun 19, 2024
Figure 1 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 2 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 3 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 4 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Viaarxiv icon

Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM

Add code
May 09, 2024
Viaarxiv icon